Using Incomplete Trios to Boost Confidence in Family Based Association Studies

نویسندگان

  • Varsha Dhankani
  • David L. Gibbs
  • Theo Knijnenburg
  • Roger Kramer
  • Joseph Vockley
  • John Niederhuber
  • Ilya Shmulevich
  • Brady Bernard
چکیده

Most currently available family based association tests are designed to account only for nuclear families with complete genotypes for parents as well as offspring. Due to the availability of increasingly less expensive generation of whole genome sequencing information, genetic studies are able to collect data for more families and from large family cohorts with the goal of improving statistical power. However, due to missing genotypes, many families are not included in the family based association tests, negating the benefits of large scale sequencing data. Here, we present the CIFBAT method to use incomplete families in Family Based Association Test (FBAT) to evaluate robustness against missing data. CIFBAT uses quantile intervals of the FBAT statistic by randomly choosing valid completions of incomplete family genotypes based on Mendelian inheritance rules. By considering all valid completions equally likely and computing quantile intervals over many randomized iterations, CIFBAT avoids assumption of a homogeneous population structure or any particular missingness pattern in the data. Using simulated data, we show that the quantile intervals computed by CIFBAT are useful in validating robustness of the FBAT statistic against missing data and in identifying genomic markers with higher precision. We also propose a novel set of candidate genomic markers for uterine related abnormalities from analysis of familial whole genome sequences, and provide validation for a previously established set of candidate markers for Type 1 diabetes. We have provided a software package that incorporates TDT, robustTDT, FBAT, and CIFBAT. The data format proposed for the software uses half the memory space that the standard FBAT format (PED) files use, making it efficient for large scale genome wide association studies.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Power evaluations for family-based tests of association with incomplete parental genotypes.

While a variety of methods have been developed to deal with incomplete parental genotype information in family-based association tests, sampling design issues with incomplete parental genotype data still have not received much attention. In this article, we present simulation studies with four genetic models and various sampling designs and evaluate power in family-based association studies. Ef...

متن کامل

Common FLG Mutation K4671X Not Associated with Atopic Dermatitis in Han Chinese in a Family Association Study

BACKGROUND Filaggrin gene (FLG) mutations have been identified as the cause of ichthyosis vulgaris (IV) and major predisposing factors for atopic dermatitis (AD). The relationship among AD, IV and FLG mutations has not been clarified yet. Mutations 3321delA and K4671X, two of the most common mutations in Chinese patients, were both statistically associated with AD in case-control studies. MAT...

متن کامل

Association between angiotensin-converting enzyme gene polymorphisms and diabetic nephropathy: case-control, haplotype, and family-based study in three European populations.

Angiotensin 1-converting enzyme gene (ACE) is a risk factor for diabetic nephropathy (DN) in patients with type 1 diabetes. The selection of this candidate gene is supported by cross-sectional and follow-up studies, but no convincing family-based studies are available. Recruited were 1057 patients (with DN: persistent albuminuria with or without renal failure) and 1127 control subjects (long-st...

متن کامل

Combining genetic association study designs: a GWAS case study

Genome-wide association studies (GWAS) explore the relationship between genome variability and disease susceptibility with either population- or family-based data. Here, we have evaluated the utility of combining population- and family-based statistical association tests and have proposed a method for reducing the burden of multiple testing. Unrelated singleton and parent-offspring trio cases a...

متن کامل

Imputation of parent-offspring trios and their effect on accuracy of genomic prediction using Bayesian method

The objective of this study was to evaluate the imputation accuracy of parent-offspring trios under different scenarios. By using simulated datasets, the performance Bayesian LASSO in genomic prediction was also examined. The genome consisted of 5 chromosomes and each chromosome was set as 1 Morgan length. The number of SNPs per chromosome was 10000. One hundred QTLs were randomly distributed a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Frontiers in genetics

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2016